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Abstract— This paper considers three kinds of length se- 
quences of the complete genome. Detrended fluctuation anal- 
ysis, spectral analysis, and the mean distance spanned within 
time L axe used to discuss the correlation property of these 
sequences. The values of the exponents from these methods of 
these three kinds of length sequences of bacteria indicate that 
the long-range correlations exist in most of these sequences. 
The correlation have a rich variety of behaviours including 
the presence of anti-correlations. Further more, using the 
exponent 7, it is found that these correlations are all linear 
(7 = 1.0 ± 0.03). It is also found that these sequences exhibit 
1// noise in some interval of frequency (/ > 1). The length 
of this interval of frequency depends on the length of the se- 
quence. The shape of the periodogram in / > 1 exhibits some 
periodicity. The period seems to depend on the length and 
the complexity of the length sequence. 

PACS numbers; 87.10+e, 47.53+n 

Key words; Coding/noncoding segments, length sequence, 
complete genome, detrended fluctuation analysis, 1// noise. 



I. INTRODUCTION 

Recently, there has been considerable interest in 
the finding of long-range eprrelation (LRC) in DNA se- 
quences [1 — 16]. Li et al EJ found that the spectral den- 
sity of a DNA sequence containing mostly introns shows 
1//^ behaviour, which indicates the presence of LRC. 
The correlation properties of coding and nancoding DNA 
sequences were first studied by Peng et al B in their frac- 
tal landscape or DNA walk model. The DNA walk de- 
fined in is that the walker steps "up" if a pyrimi- 
dine (C or T) occurs at position i along the DNA chain, 
while the walker steps "down" if a purine {A or G) occurs 
at position i. Peng et al a discovered that there exists 
LRC in noncoding DNA sequences while the coding se- 
quences correspond to a regular random walk. By doing a 
more detaile(i,analysis, Chatzidimitriou, Dreismann and 
Larhammar concluded that both coding and noncod- 



* Corresponding author, e-mail; yuzg@hotmail.com or 
z.yu@qut.edu.au 
^This is the permanent corresponding address of Zu-Guo Yu. 



ing sequences ^hibit LRC. A subsequent work by Prabhu 
and Claverie 13 also substantially corroborates these re- 
sults. If one considers more details by distinguishing C 
from T in pyrimidine, and A from G in purine (such as 
two or three-dimensional DNA walk model O and maps 
given in then the presence of base correlation has 
been found even in coding sequences. In view of the con- 
troversy about the presence of correlation ij3-iall DNA 
or only in noncoding DNA, Buldyrev et al M showed 
the LRC appears mainly in noncoding DNA usingpaJl 
the DNA sequences available. Alternatively, Voss tHl, 
based on equal-symbol correlation, showed a power law 
behaviour for the sequences studied regardless of the per- 
cent of intron contents. Investigations based on different 
models seem to suggest different results, as they all look 
into only a certain aspect of the entire DNA sequence. It 
is therefore important to investigate the degree of corre- 
lations in a model-independent way. 

Since the first complete genome of the free-living 
bactepojm Mycoplasma genitalium was sequenced in 
1995 II3I , an ever-growing number of complete genomes 
has been deposited in public databases. The availabil- 
ity of complete genomes induces the possibility to ask 
some global questions on these sequences. The avoided 
and under-represented strings in some bacterial complete 
genomes have been discussed in [p^|-pO[. A time series 
model of CDS in complete genome has, also been pro- 
posed in |2^. Maria de Sousa Vieira H have done the 
low-frequency analysis of complete DNA of 13 micro- 
bial genomes and shown that its fractal behaviour not 
always prevails through the entire chain, the autocorre- 
lation function have a rich variety of behaviours including 
the presence of anti-correlations. 

For the importance of the numbers, sizes and order- 
ing of genes along the chromosome, one can refer to Part 
5 of the famous book of Lewin (Ref. |23j). Hence one 
may ignore the composition of the four kinds of bases in 
coding and noncoding segments and only considers the 
rough structure of the complete genepie or long DNA se- 
quences. Provata and Almirantis EJ proposed a fractal 
Cantor pattern of DNA. They map coding segments to 
filled regions and noncoding segments to empty regions 
of random Cantor set and then calculate the fractal di- 
mension of the random fractal set. They found that the 
coding/noncoding partition in DNA sequences of lower 
organisms is homogeneous-like, while in the higher eucar- 
iotes the partition is fractal. This result seems too rough 
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to distinguish bacteria because the fractal dimensions of 
bacteria they gave out are all the same. The classification 
and evolution relationship of bacteria is one of the most 
important problem in DNA research. Yu and Anh c3 
proposed a time series model based on the global struc- 
ture of the complete genome and considered three kinds 
of length sequences. After calculating the correlation di- 
mensions and Hurst exponents, it was found that one 
can get more information from this model than that of 
fractal Cantor pattern. Some results on the classification 
and evolution relationship of bacteria were found in [ p5| . 
Naturally it is desirable to know if there exists LRC in 
these length sequences. The quantification of these cor- 
relations could give an insight of the role of the ordering 
of genes on the chromosome, which is far to be irrelevant 
for gene function. We attempt to answer this question in 
this paper. 

Viewing from the level of structure, the complete 
genome of an organism is made up of coding and non- 
coding segments. Here the length of a coding/noncoding 
segment means the number of its bases. Based on the 
lengths of coding/noncoding segments in the complete 
genome, we can get three kinds of integer sequences by 
the following ways. 

i) First we order all lengths of coding and noncoding 
segments according to the order of coding and noncod- 
ing segments in the complete genome, then replace the 
lengths of noncoding segments by their negative num- 
bers. This allows to distinguish lengths of coding and 
noncoding segments. This integer sequence is named 
whole length sequence. 

ii) We order all lengths of coding segments according 
to the order of coding segments in the complete genome. 
We name this integer sequence coding length sequence. 

iii) We order all lengths of noncoding segments accord- 
ing to the order of noncoding segments in the complete 
genome. This integer sequence is named noncoding length 
sequence. 

We can now view these three kinds of integer sequences 
as time series. In the following, we will investigate 
the correlation pmperty through Detrended Fluctuation 
Analysis (DFA) O and spectral analysis. 



II. DETRENDED FLUCTUATION ANALYSIS 
AND SPECTRAL ANALYSIS 

We denote a time series as X{t)^t = 1, - ■ ■ ,N. First 
the time series is integrated as y{k) — J2t=ii-^i^)~-^ave], 
where Xave is the average over the whole time period. 
Next, the integrated time series is divided into boxes of 
equal length, n. In each box of length n, a least-squares 
line is fit to the data, representing the trend in that box. 
The y coordinate of the straight line segments is denoted 
by yn{k). We then detrend the integrated time series, 
y{k), by subtracting the local trend, yn{k), in each box. 
The root-mean-square fluctuation of this integrated and 



detrended time series is calculated as 
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Typically, F(n) will increase with box size n. A linear 
relationship on a double log graph indicates the presence 
of scaling 
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Under such conditions, the fluctuations can be charac- 
terised by the scaling exponent a, the slope of the line 
relating In i^(n) to In n. For uncorrelated data, the in- 
tegrated value, y{k) corresponds to a random walk, and 
therefore, a — 0.5. An value of 0.5 < a < 1.0 indicates 
the presence of LRC so that a large interval is more likely 
to be followed by a large interval and vice versa. In con- 
trast, < a < 0.5 indicates a different type of power-law 
correlations such that large and small values of time se- 
ries are more likely to alternate. For examples, we give 
the DFA of the coding length sequence of A. aeolicus in 
FIG. |l|. 

Now we analyse the time series using the quantity 
M{L), the mean distance-a. walker spanned within time 
L. Dunki and Ambuhl O'Efl used this quantity to discuss 
the scaling property in temporal patterns of schizophre- 
nia. Denote 
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from which we get the walks 



M{L) \W{j) - W{j + L)\ >„ (4) 

where < >j denotes the average over j, and j — 
1,---,N — L. The time shift L typically varies from 
1, ■ • ■ , N/2. From a physics viewpoint, M{L) might be 
thought of as the variance evolution of a random walker's 
total displacement mapped from the time series X[t). 
Af (L) may be assessed for LRC (e.g. M(L) oc L"', 
a' — 1/2 corresponding to the random case). We give 
some examples to estimate the scale parameter a' in FIG. 

Dunki et al proposed the following scale which 
seems to perform better than the scale a' . The definition 



W'ij) :=£|A(<)~A, 



leads to 



M'iL) :=< \W'ij)-W'ij+L)\ >, 



(5) 
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Analyses of test time series showed that are more 
robust against distortion or discretization of the corre- 
sponding amplitudes X[t) than (|^). From the ln(L) v.s. 
ln(Af' (L)) plane, we find the relation 
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The exponent 7 measures only the presence of non- 
hnear correlations and remains equal to unity for all se- 
quences with only linear correlations. We carried out this 
kind of analysis on coding length sequences of A. aeoli- 
cus, B. burgdorferi and T. maritima. The results are 
reported in the left figure of FIG. ||. ,_. 

We also consider the discrete Fourier transform & of 
the time series X(t),t = 1, ■ ■ ■ , N defined by 



X{f) = N-i + l)e"'"'^*, 



then 



S{f) = \X{f)\ 



(8) 
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9ectrum of X (t). In recent studies, it has 
^" that many natural phenomena lead to 



is the power 
been found 

the power spectrum of the form 1/ f^. This kind of de- 
pendence was named 1// noise, in contrast to white noise 
S{f) = const, i.e. /3 = 0. Let the frequency / take k val- 
ues fk = k/N, fc = 1, • • • , iV. From the ln(/) v.s. ln(S'(/)) 
graph, the existence of l/f^ doesn't seem apparent. For 
example, we give the figure of the coding length sequence 
of A. aeolicus on the right of FIG. 

When we use the least squares line to fit data, we need 
to consider the errors. If the data are {(a;;, j/i)}]™, , we 
can define the coefficient of linear correlation as E3 



Er=i(^» -x){yi -y) 



(10) 



where x and y are the average of the values {xi\^^i and 
respectively. If r = ±1, then the points lie ex- 
actly on a straight line; that is, there is a perfect linear 
relationship between x and y. If r = 0, there is no linear 
relationship. The quantity r measures the strength of 
linear relationships between x and y. The values of r in 
figures of obtaining exponents a, a' and (3 are 0.987685, 
0.9949939 and 3.7918E-03 respectively. Hence we can see 
the informations given by exponents a and a' are more 
convincing than that given by exponent (3. 



III. DATA AND RESULTS. 

More than 21 bacterial complete genomes are 
now available in public databases. There are five Ar- 
chaebacteria: Archaeoglobus fulgidus (aful), Pyrococ- 
cus abyssi (pabyssi), Methanococcus jannaschii (mjan), 
Aeropyrum pernix (aero) and Methanobacterium ther- 
moautotrophicum (mthe); four Gram-positive Eubacte- 
ria: Mycobacterium tuberculosis (mtub), Mycoplasma 
pneumoniae (mpneu), Mycoplasma genitalium (mgen). 



and Bacillus subtilis (bsub). The others are Gram- 
negative Eubacteria. These consist of two Hyperther- 
mophilic bacteria: Aquifex aeolicus (aquae) and Ther- 
motoga maritima (tmar); six Proteobacteria: Rhizo- 
bium sp. NGR234 (pNGR234), Escherichia coli (ecoh), 
Haemophilus influenzae (hinf), Helicobacter pylori J99 
(hpyl99), Helicobacter pylori 26695 (hpyl) and Rickettsia 
prowazekii (rpxx) ; two Chlamydia: Chlamydia trachoma- 
tis (ctra) and Chlamydia pneumoniae (cpneu), and two 
Spirochaete: Borrelia burgdorferi (bbur) and Treponema 
pallidum (tpal). 

We calculate scales a, a', of low frequencies (/ < 1) 
and 7 of three kinds of length sequences of the above 21 
bacteria. The estimated results are given in Table | ( 
where we denote by Uwhoie, c^cod and ctnoncod the scales 
of DFA of the whole, coding and noncoding length se- 
quences, from top to bottom, in the increasing order of 
the value of awhoie ), Table || ( where we denote by 
C'whoie^ "cod and a;„„^„rf the scales of M{L) of the whole, 
coding and noncoding length sequences, from top to bot- 
tom, in the increasing order of the value of cx'^hoie ) and 
Table |l| ( where we denote by Pwhoie, Pcod and Pnoncod 
the scales of spectral analysis of the whole, coding and 
noncoding length sequences, from top to bottom, in the 
decreasing order of the value of Pwhoie', we denote by 
Iwhoie, Icod and ^noncod the scales of 7 of the whole, cod- 
ing and noncoding length sequences). 

From the right figure of FIG. || it is seen that S{f) 
does not display clear power-law 1// dependence on the 
frequencies when / < 1. Although the meaning of re- 
gion / > 1 of the power spectrum is not clear, whether 
S{f) displays perfect power-law 1// in this region is im- 
portant. When one considers the electrical character- 
istics of polysilicon emitter bipolar transistors, for high 
frequency analog applications the transistor 1// noise is 
also an important parametei}-sincc it can degrade the 
spectral purity of the circuit O . There is also some ev- 
idence that 1// noise spectral density in the low and in 
the high current region have a different physical origion 
(the reader can refer Ref . and reference therein) . We 
want to know if there is another region of frequencies in 
which S{f) displays perfect power-law 1// dependence 
on the frequencies. We carried out the spectral analysis 
for / > 1, and found that S{f) displays almost a perfect 
power-law 1// dependence on the frequencies in some 
interval: 
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We give the results for coding length sequences of M. 
genitalium, A. fulgidus, A. aeolicus and E. coli (their 
lengths are 303, 1538, 891 and 3034 respectively) in FIG. 
^, where we take k values fk = 'ik{k = \,---, 1000) of 
the frequency /. From FIG. |[ it is seen that the length of 
the interval of frequency in which S{f) displays almost 
a perfect power-law 1// depends on the length of the 
length sequence. The shorter sequence corresponds to 
the larger interval. 
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From FIG. one can see that the power spectrum 
exhibit some kind of periodicity. But the period seems 
to depend on the length of the sequence. We also guess 
that the period also depends on the complexity of the 
sequence. To support this conjecture, we got a promoter 
DNA sequence from the gene bank, then replaced Ahy - 
2, C by -1, G by 1 and T by 2 (this map is given in |^); so 
we obtained a sequence on alphabet {—2, —1, 1, 2}. Then 
a subsequences was obtained with the length the same as 
the coding length sequences of A. aeolicus, A. fulgidus 
and M. genitalium (their lengths are 891, 1538 and 303 
respectively). A comparison is given in FIG. but the 
results are not clear-cut. 



IV. DISCUSSION AND CONCLUSIONS 

Although the existence of the archaebacterial urk- 
ingdom has been accepted by many biologists, the clas- 
sification of bacteria is still a matter of controversy . 
The evolutionary relationship of the three primary king- 
doms (i.e. archeabacteria, eubacteria and eukarypte) is 
another crucial problem that remains unresolved Eil . 

From Table |, we can roughly divide bacteria into two 
classes, one class with awhoie less than 0.5, and the other 
with ttwhoie greater than 0.5. All Archaebacteria belong 
to the same class except Pyrococcus abyssi. All Pro- 
teobacteria belong to the same class except E. coli; in 
particular, the closest Proteobacteria Helicobacter pylori 
26695 and Helicobacter pylori J99 group with each other. 
In the class with a^hoie < 0.5, we have acod < a„oncod 
except H. pylori J99 and M. genitalium; but in the other 
class we have acod > a„oncod- 

Using the exponent a', we can also divide bacteria into 
two class as in Table In one class, a'^^^ < a'^oncod- 



In another class, we have a^^^ > a'noncod except Tre- 
ponema pallidum and Borrelia burgdorferi. Two Hyper- 
thermophilic bacteria Aquifex aeolicus and Thermotoga 
maritima group with each other. 

From Tables | and ||, we can see the similar rules as 
above if we use the exponents acod and a^^^. This follows 
the fact that the coding sequences occupy the main part 
of space of the DNA chain of bacteria. This coincides 
with the conclusion of Rcf. IM . 



Although from Table HI, we can see the values of all 
(3 are not far from 0. From FIG.s ^ and ||, one can 
see exponents a and a' are more convincing than the 
exponent p because the error of estimating a and a' us- 
ing the least-squares linear fit is much less than that of 
the exponent /3 (The values of r in figures of obtain- 
ing exponents a, a' and (3 are 0.987685, 0.9949939 and 
3.7918E-03 respectively). From Tables | and ||, we can 
see most of values a and a' are not equal to 0.5, hence we 
can conclude that most of these length sequences exhibit 
long-range correlations. We can also see the correlation 
have a rich variety of behaviours including the presence 
of anti-correlations. Hence the length sequences have a 



same character as the DNA sequences 'cHI . Further more, 
from Table HI, we get 7 = 1.0 ± 0.03. Hence we can 
conclude that the long-range correlations exist in most 
length sequences are linear. 

We find in an interval of frequency (/ > 1), S{f) dis- 
plays perfect power-law 1/f dependence on the frequen- 
cies (see FIG. |) 



S{f) a 



1 

7^- 



The length of the interval of frequency in which S{f) 
displays almost a perfect power-law 1/f depends on the 
length of the length sequence. The shorter sequence cor- 
responds to the larger interval. The shape of the graph 
of power spectrum in / > 1 also exhibits some kind of 
periodicity. The period seems to depend on the length 
and the complexity of the length sequence. 
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TABLE I. Uuihoie, oicod and anoncod of 21 bacteria. 



Bacteria 


Category 


^whole 




^noncod 


Rhizobium sp. NGR234 


Protcobacteria 


0.24759 


0.11158 


0.34342 


Mycoplasma genitalium 


Gram-positive Eubacteria 


0.37003 


0.25374 


0.18111 


Chlamydia trachomatis 


Chlamydia 


0.42251 


0.37043 


0.49373 


Thcrmotoga maritima 


Hyperthermophilic bacteria 


0.43314 


0.47659 


0.49279 


Mycoplasma pneumoniae 


Gram-positive Eubacteria 


0.44304 


0.45208 


0.49922 


Pyrococcus abyssi 


Archacbactcria 


0.48568 


0.39271 


0.42884 


Helicobacter pylori J99 


Protcobacteria 


0.48770 


0.43562 


0.42089 


Helicobacter pylori 26695 


Protcobacteria 


0.49538 


0.37608 


0.41374 


Haemophilus influenzae 


Protcobacteria 


0.49771 


0.42432 


0.53013 


Rickettsia prowazekii 


Protcobacteria 


0.49950 


0.33089 


0.51923 


Chlamydia pneumoniae 


Chlamydia 


0.53982 


0.53615 


0.38085 


Methanococcus jannaschii 


Archaebacteria 


0.54516 


0.58380 


0.34482 


M. tuberculosis 


Gram-positive Eubacteria 


0.55621 


0.57479 


0.52949 


Acropyrum pernix 


Archacbactcria 


0.57817 


0.63248 


0.44829 


Bacillus subtilis 


Gram-positive Eubacteria 


0.58047 


0.59221 


0.54480 


Borrelia burgdorferi 


Spirochaete 


0.58258 


0.53687 


0.51815 


Archacoglobus fulgidus 


Archacbactcria 


0.59558 


0.59025 


0.46596 


Aquifcx acolicus 


Hyperthermophilic bacteria 


0.59558 


0.55964 


0.43141 


Escherichia coli 


Protcobacteria 


0.60469 


0.62011 


0.52000 


M. thermoautotrophicum 


Archaebacteria 


0.62055 


0.64567 


0.38825 


Treponema pallidum 


Spirochaete 


0.67964 


0.70297 


0.60914 



TABLE II. Q^hoje, a^od and a'^oncod of 21 bacteria. 



Bacteria 


Category 


/ 


"cod 


/ 

^noncod 


Rhizobium sp. NGR234 


Protcobacteria 


0.17021 


0.11223 


0.28573 


Chlamydia trachomatis 


Chlamydia 


0.172340 


0.23801 


0.66431 


M. tuberculosis 


Gram-positive Eubacteria 


0.20185 


0.18451 


0.43716 


Mycoplasma genitalium 


Gram-positive Eubacteria 


0.21632 


0.25185 


0.25971 


Escherichia coli 


Protcobacteria 


0.25837 


0.24567 


0.62126 


Pyrococcus abyssi 


Archaebacteria 


0.29809 


0.18061 


0.48169 


Bacillus subtilis 


Gram-positive Eubacteria 


0.36791 


0.46816 


0.55325 


Mycoplasma pneumoniae 


Gram-positive Eubacteria 


0.37148 


0.46475 


0.46829 


Chlamydia pneumoniae 


Chlamydia 


0.37216 


0.26939 


0.50734 


Rickettsia prowazekii 


Protcobacteria 


0.41040 


0.23109 


0.50930 


Archacoglobus fulgidus 


Archaebacteria 


0.43149 


0.35370 


0.60835 


Helicobacter pylori 26695 


Protcobacteria 


0.44082 


0.38500 


0.39325 


Haemophilus influenzae 


Protcobacteria 


0.46121 


0.44842 


0.34842 


Aeropyrum pernix 


Archaebacteria 


0.46203 


0.45520 


0.24850 


M. thermoautotrophicum 


Archaebacteria 


0.48038 


0.48870 


0.36249 


Thcrmotoga maritima 


Hyperthermophilic bacteria 


0.49453 


0.50457 


0.27005 


Aquifcx aeolicus 


Hyperthermophilic bacteria 


0.50237 


0.50582 


0.31488 


Helicobacter pylori J99 


Proteobacteria 


0.54547 


0.50999 


0.48640 


Treponema pallidum 


Spirochaete 


0.56357 


0.56808 


0.65350 


Borrelia burgdorferi 


Spirochaete 


0.61186 


0.58016 


0.61772 


Methanococcus jannaschii 


Archaebacteria 


0.72726 


0.73384 


0.33780 
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TABLE III. Pyjhoie, Pcod and /3noncod; fwhoie, food and jnoncod of 21 bacteria. 



Bacteria 


0-whole 


Pcod 


0noncod 


"ywhole 


Icod 


'~fnoncod 


M. genitalium 


0.05880 


0.02030 


-0.00708 


1.00017 


0.99698 


1.01652 


H. pylori 26695 


0.05026 


-0.01412 


0.01196 


0.99902 


1.00057 


0.99538 


M. jaima.scliii 


0.()lcS5() 


-0.026 10 


-0.12517 


0.99727 


0.99079 


0.99767 


C. pneumoniae 


0.04405 


0.01071 


-0.01906 


0.99998 


1.00099 


0.99348 


A. aeolicus 


0.03152 


0.00811 


-0.00115 


1.00441 


0.99816 


0.99870 


H. pylori J99 


0.01968 


0.04512 


-0.05815 


0.99867 


0.99926 


0.99349 


T. maritima 


0.00737 


-0.02656 


0.01965 


0.99726 


0.99524 


0.98866 


C. trachomatis 


0.00256 


-0.05829 


-0.02549 


0.99767 


1.00211 


0.98553 


R. sp. NGR234 


0.00230 


0.04048 


-0.10905 


1.00570 


0.99612 


1.01515 


M. thermoauto. 


-0.00217 


-0.11916 


0.02079 


1.00479 


1.00171 


1.00063 


T. pallidum 


-0.00422 


-0.02902 


0.09510 


1.01009 


1.01532 


1.00222 


M. pneumoniae 


-0.01137 


0.03437 


-0.05573 


0.98820 


0.98783 


0.97260 


P. abyssi 


-0.01589 


-0.04242 


0.00071 


0.99888 


0.99816 


0.99293 


E. coli 


-0.01917 


-0.05513 


0.01772 


0.99856 


1.00197 


0.98938 


M. tuberculosis 


-0.02653 


-0.05653 


-0.02698 


1 AAA£JO 

l.OOOoi 


0.99974 


"1 AAOA1 

1.00801 


A. pernix 


-0.03882 


0.01648 


-0.09395 


1.00298 


1.00407 


1.00286 


B. burgdorferi 


-0.04420 


-0.05189 


-0.10710 


0.99287 


0.99792 


1.03206 


R. prowazekii 


-0.04884 


-0.12438 


-0.07581 


1.00284 


0.99043 


0.99991 


H. influenzae 


-0.05338 


-0.04853 


-0.04341 


0.99798 


1.00248 


0.98684 


A. fulgidus 


-0.06372 


-0.08130 


-0.00881 


1.00347 


1.00610 


0.98219 


B. subtilis 


-0.06887 


-0.17231 


-0.02380 


0.99629 


1.00853 


0.98666 
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1 2 3 4 5 6 7 -7 -6 -5 -4 -3 -2 -1 

ln(L) ln(f) 
FIG. 3. Left) Estimate the scale 7. Right) An example of spectral analysis of low frequencies / < 1- 
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